Self-organized Emergence of Navigability on Small- World Networks 
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This paper mainly investigates why small-world networks are navigable and how to navigate 
small-world networks. We find that the navigability can naturally emerge from self-organization in 
the absence of prior knowledge about underlying reference frames of networks. Through a process 
of information exchange and accumulation on networks, a hidden metric space for navigation on 
networks is constructed. Navigation based on distances between vertices in the hidden metric space 
can efficiently deliver messages on small-world networks, in which long range connections play an 
important role. Numerical simulations further suggest that high cluster coefficient and low diameter 
are both necessary for navigability. These interesting results provide profound insights into scalable 
routing on the Internet due to its distributed and localized requirements. 
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I. INTRODUCTION 



Small-world (SW) networks are ubiquitous in nature 
and society. Travers and Milgram discovered small- world 
phenomenon through delivering letters among people in 
late 1960s In the experiment, each participant could 
only deliver letters to a single acquaintance who was more 
possible to deliver letters to target persons based on their 
own judgement. Relying on this greedy routing strat- 
egy, or so-called navigation, at last 29 percent of letters 
reached target persons and the average length of acquain- 
tance chains of letters that were successfully sent was 6. 
Recent experiments have proved that the greedy rout- 
ing strategy could efficiently pass messages on email net- 
works and online social service networks 0-3. These 
striking results suggest that people are connected with 
much shorter chains than our imagination and they can 
find the short paths based solely on local information, re- 
gardless of the network size and the topological distances 
among people. 

Navigability of SW networks has gained tremendous 
interests of scientists. A variety of models have been 
proposed to explain the underlying mechanisms that en- 
sure findingshortest paths based exclusively on local in- 
formation [5|-l7|. In these models, networks were gener- 
ated based on underlying reference frames, e.g. grids, 
hierarchy, and hyperbolic spaces, which determined how 
networks were organized. Vertices were contained in the 
underlying reference frames which provided definitions 
of distances between vertices, and adjacent vertices were 
more likely to be connected. Navigation was modeled by 
greedy routing: messages were sent to one neighbor near- 
est to the target in underlying reference frames, which 
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was efficient for passing messages if vertices were aware of 
positions of its neighbors and targets. Indeed, the afore- 
mentioned works suggest that networks act as an overlay 
on underlying reference frames during navigation. There- 
fore, navigability of networks are based on the fact that 
the underlying reference frames are navigable. In these 
models, efficient navigation needs prior knowledge about 
organization of networks. However, several real large-size 
networks, e.g. email networks and online social service 
networks, are self-organized, so that it's hard for indi- 
viduals to be aware of underlying reference frames and 
discover their exact positions. 

Here, we aim to address the navigability of SW net- 
works through a different way: establishing a general 
scheme for efficient navigation, regardless of the under- 
lying reference frames of networks. This kind of prob- 
lems have been considered before. One method is to re- 
construct underlying reference frames, e.g. embedding 
networks generated by Kleinberg's model into Euclidean 
plane and reconstructing the dimension of the underlying 
lattice when network generated by long-range percolation 
[1, 1^ . The other method is to embed a network into a 
metric space and ensure that distances between vertices 
are proportional to shortest path length through a proper 
embed ding alg orithm, regardless of underlying reference 
frames [iMl- 

We construct a scheme for navigation following the 
idea of metric space. The embedding algorithm is in- 
spired by the fact that navigation on social networks 
is based on information exchanged and accumulated by 
communication, which is used to determine who among 
the acquaintances are 'socially closest' to target persons. 
Therefore we embed networks into a metric space through 
a process of information exchange and accumulation, in 
which vertices find their positions by distributed and lo- 
calized self-organization. It is demonstrated by numerical 
simulations that the self-organized algorithm can estab- 
lish a scheme for efficient navigation, irrespective of the 
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underlying reference frames of networks, and we find that 
the navigability of networks is influenced by SW proper- 
ties which are characterized by low diameter and high 
cluster coefficient. 



where di is the degree of vertex i. Equations (|4]) and ([5]) 
can be rewritten in matrix form as the combinations of 
eigenvectors of normal matrix N of network 



Po = Xo = VA, 



(6) 



II. ALGORITHM TO ESTABLISH 
NAVIGATION SCHEME 



Xf = NXt_i = N*Xo = VD*A, 



(7) 



The key for addressing the navigability lies in the self- 
organized embedding algorithm in the absence of prior 
knowledge about underlying reference frames. In our al- 
gorithm, an m-dimension Euclidean space is chosen as 
the metric space to define distances between vertices. 
Then we follow the self-organized process of information 
exchange and accumulation on social networks, which is 
described as follows 



Pi t = Pj\t-1 + X; 



(1) 



(2) 



where is the set of immediate neighbors of vertex i. 
Vectors x and p consist of m elements corresponding to 
the m dimensions of metric space. The vector x is cou- 
pled through the network topology and simultaneously 
updated according to Eq. ([T]), while position vector p is 
the cumulative summation of historical vector x. Since 
information exchange in Eq. ([T|) is restricted between 
vertices and their direct neighbors, the algorithm is dis- 
tributed and localized. Meanwhile, distances between 
vertices will be constant if vector x can converge after 
sufficient evolving steps. Moreover, vertices can be seen 
as flocking in a metric space, and vectors x and p rep- 
resent velocity and position, like in Vicseck model |13| . 
Velocities of tightly connected vertices synchronize more 
quickly. Therefore, vertices connected by shorter paths 
will gather in the metric space, which ensures that the 
distances between vertices in the metric space are asso- 
ciated with path lengths on networks. Messages can be 
delivered along short paths by navigation based on dis- 
tances in the metric space. 

Many dynamics can be applied as a realization of 
Eq. (jlj, such as chaotic oscillators coupled by networks 
which can synchronize depending on suitable coupling 
strengths. For the purpose of its simplicity, we choose 
the updating rule of vector x as follows: at every time 
step, values of x^ is the average of its neighbors and the 
initial Pi^o equals to x^^o- Then the algorithm can be 
written as 



Po = xo. 



Pi,t — Pi,t-1 



(3) 



(4) 



(5) 



Pt = Pt-i + Xt = V(I + ^ D^)A. 



(8) 



Every row of matrices X and P is the vector of velocities 
and positions of each vertex. Columns of matrix V are 
the eigenvectors of N. A consists of linear combination 
coefficients when eigenvectors of N are chosen as basis 
vectors. Matrix D is a diagonal matrix with eigenvalues 
of normal matrix on the main diagonal. Because eigen- 
values of N are in the interval [—1,1], for long enough 
evolving time t, we get the final position matrix P as 



YEA. 



(9) 



Matrix E is a diagonal matrix whose ith diagonal ele- 
ment is 1/(1 — Xi), where is the eigenvalue of normal 
matrix. It can be seen that eigenvectors corresponding 
to large eigenvalues play more important roles in the po- 
sition matrix as a result of the factor 1/(1 — Xi). 

Since the positions of vertices in the metric space are 
linear combinations of eigenvectors of normal matrix, it 
demonstrates that the embedding can represent network 
topology, which is reflected by the fact that adjacent ver- 
tices in the metric space are connected by shorter paths 
on network. The distance between vertex i and j after 
sufficient evolving time is 
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(10) 



where ai,j and u^j are the elements of matrix A and 
V, respectively. If elements of Xq are uniformly dis- 
tributed in interval [—1,1], the elements of matrix A 
have the following properties: (flij) = 0, {ai,jak,i) — 
and {of j) — {x^). In addition, if m is sufficiently large, 
the distance can be expressed by 

m{x'^) 



T- 



(11) 



Equation (|lip shows that the distances between vertices 
can be seen as those in the situation that position values 
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of vertices are elements of weighted eigenvectors of nor- 
mal matrix. Due to the factor (1 — A^)"^, distances are 
mostly determined by eigenvectors associated with large 
eigenvalues. It has been proved that these eigenvectors 
are the solutions of following constrained optimization 
problem Let the energy of system z{x) be defined 



(12) 



where L is the Laplace matrix of network and x are posi- 
tion values assigned to the vertices together with a con- 
straint 



x'Kx = 1, 



(13) 



where matrix K is a diagonal matrix whose ith main di- 
agonal element is the degree of vertex i. Let Ai < A2 < 
. . . < A„_i < A„ = 1 be the eigenvalues, and the corre- 
sponding eigenvectors under constraint of Eq. (jl3p are 
vi,V2, ■ ■ ■ , Vn-i and w„. The minimum nontrivial value of 
z is 1 — A„_i, and the relevant position vector x is 
If the energy reaches the minimum nontrivial value, ver- 
tices which are connected by a number of short paths 
are sufficiently close in the metric space constructed by 
eigenvectors, which ensures that distances in the metric 
space correspond to path lengths on networks. 

Due to the fact that similar vertices are more likely to 
be connected, it's natural to evaluate similarities based 
on the number of paths between vertices and the length 
of paths in the absence of prior knowledge of underlying 
reference frames [lB|- Through this evaluation, vertices 
connected by more and shorter paths, which will be adja- 
cent in the metric space after self-organized embedding, 
are deemed to be more similar. Therefore, the results 
of embedding algorithm are in consistent with the basic 
ideas of underlying reference frames: similar vertices are 
adjacent, and more likely to be connected. 



III. EXPERIMENTAL RESULTS 

A. Experimental Results of Small- World Networks 
Generated by WS Model 

The self-organized embedding algorithm is applied to 
build navigation scheme on SW networks generated by 
WS (Watts-Strogatz) model, in which SW properties re- 
sult from rewiring edges of original regular network at 
probability p p^ . The chosen original regular network 
has n=1000 vertices, and each vertex link to fc=10 near- 
est others. The diameters and cluster coefficients of net- 
works at different rewiring probabilities are shown in Fig. 
1(a) Experimental results are averaged over 20 network 
realizations. As shown in Fig l(a)[ even for the small 



rewiring probability, the diameters of networks decrease 
sharply while the cluster coefficients are nearly the same 
as the original regular network. 



At the beginning of embedding algorithm, every vertex 
is assigned an initial velocity Xi^Oj whose values of each 
dimension are uniformly distributed within [—0.5,0.5]. 
Dimensions of metric spaces chosen to be to = 5, 10 and 
20 are to investigate how the metric space influences nav- 
igation. The embedding algorithm is terminated when 
the velocities of vertices reaches a certain synchroniza- 
tion level. We defined the synchronization error of x^ of 
dimension k at evolving time t as 



et(fc) 
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(14) 
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When the synchronization errors of velocities at each di- 
mension are less than a small value, which is chosen as 
lO"**, the embedding algorithm is terminated. 

The greedy routing strategy to simulate navigation on 
networks can be described as follows: vertices are aware 
of positions of their neighbors in the metric space and po- 
sitions of targets are transmitted by messages. Messages 
are passed through current hop to the neighbor closest 
to targets at each step. To avoid loops, messages are 
prohibited from neighbors that have been visited. The 
routing will terminate if message reaches target or all the 
neighbors of current hop have been visited. We randomly 
pick 10^ source and target pairs for every network to be 
navigated. Notice that the navigation is not symmetric, 
e.g. navigation from vertex i to j is not equivalent to 
navigation from j to i because the local environments of 
vertex i and j are different. Efficient navigation is de- 
fined by the fact that messages are successfully passed 
to targets along the shortest paths. Therefore, we exam- 
ine two metrics to evaluate navigability: the successfully 
routing rate (the ratio of number of successfully routing 
messages and all messages) and the stretch (average of 
the ratios of routing path length and shortest path length 
of each message). 

Figure 1(b), (c) and (d) show that the successfully 
routing rates and stretches are as a function of rewiring 
probability p for the hidden metric space of different di- 
mension. When rewired connections start to arise, suc- 
cessfully routing rates increase quickly, whereas stretches 
grow much slower until cluster coefficients drop sharply. 
As a result of the different growing speeds, high success- 
fully routing rates and low stretches, which indicates the 
efficient navigation and strong navigability, simultane- 
ously occur when the networks show small- world proper- 
ties, and are much more apparent for the hidden metric 
space of larger dimensions. In other words, the larger 
dimension of hidden metric space is useful to improve 
performances of navigation, which is reflected by higher 
successfully routing rates and lower stretches at the same 
rewiring probability. 

Long range connections, or the so-called weak ties in 
sociology, play an important role in activities on net- 
works, e.g. information which people receive through 
weak ties is more useful and successfully routing messages 
on Email networks are conducted primarily through in- 
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FIG. 1: (Color online) Diameter and cluster coefficient as a function of rewiring probability p (a); performance of greedy routing 
for different dimension of metric space (b) m = 5, (c) m = 10 and (d) m = 20. Networks are generated by WS model 1^. 
Numerical simulations at each p are averaged over 20 realizations of model. SW networks show strong navigability with high 
successfully routing rate and low stretch for all dimensions. In particular, the SW properties are necessary for navigability, and 
the large metric space dimension is useful to improve navigability. 



termediate to weak strength ties [2, ll7[ . Hence it is wor- 
thy of studying that long range connections affect nav- 
igation by passing messages to vertices far away from 
each other on networks. We calculate the distributions 
of shortest path length between all pairs and success- 
fully routed pairs at different rewiring probability p when 
the metric space dimension is 20 (see in Fig. When 
there are fewer long range connections, navigation finds 
more nearby targets than those far away. The reason is 
that many messages can't travel far away from the start 
vertices on networks with high cluster coefficients result- 
ing from quickly arriving at the vertices whose neighbors 
have been all visited. As the number of long range con- 
nections increases, messages can escape from the local 
area of source vertices and travel a long distance on net- 
works to arrive at targets. Therefore, targets are suc- 
cessfully reached at the same probability for different 
path length from sources. Moreover, the two distribu- 



tions have agreed well with each other when the rewiring 
probability is 0.0008, which results in sufficiently small 
number of long-range connections compared to the total 
number of connections. The weak ties are extremely use- 
ful in the sense that even if a few long range connections 
exist, messages could be passed to the whole network. 
This fact also explains why the successfully routing rates 
immediately increase fast when there are only a few long 
range connections. 

Navigability of networks in terms of self-organized em- 
bedding algorithm is based on the fact that distances in 
the metric space are associated with similarities of ver- 
tices extracted from topology. However, we can't ensure 
that distance between every vertex pairs represents its 
similarity in the absence of central control, e.g. adjacent 
vertices in the metric space may not be tightly connected. 
Actually, greedy routings performed on networks consist 
of two parts: properly directed part is more relevant to 
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FIG. 2: (Color online) Distributions of shortest path length between vertices of all routing messages and successfully routing 
messages for different rewiring probability p:(a) p = 0.0001, (b) p — 0.0002, (c) p — 0.0004 and (d) p — 0.0008. As the number 
of long range connections increases, messages can escape from the local area of source vertices and travel a long distance. The 
two distribution have agreed well with each other even when the rewiring probability is still very small. These results reflects 
the power of weak ties: even if a few long range connections exist, messages could be passed to the whole network. 



navigation, while the remaining part is more tendency to 
random walks. For numerical simulations on SW net- 
works, when cluster coefficients stay at a high value, 
there are clusters consisting of tightly connected similar 
vertices, which satisfies the organizing rules of networks 
based on underlying reference frames. In this case, net- 
work topology can be mapped into a hidden metric space 
by self-organized embedding algorithm. Meanwhile, mes- 
sages cannot travel along paths through random walk on 
networks with high cluster coefficients because they are 
easy to reach a vertex, all of whose neighbors have been 
visited. Successfully passed messages on highly clustered 
networks are mostly routed by navigation, which leads 
to low stretches. When cluster coefficients drops quickly, 
the local clusters vanish by randomly rewired connec- 
tions and vertices are randomly placed in the hidden 
metric space, which differs the embedding of networks 
from the network topology. In this regard, random walks 



can travel a long path to reach targets because vertices 
have little common neighbors. Therefore, most messages 
are successfully delivered by random walks, which leads 
to large stretches and the successfully routing rates are 
close to 1. 



B. Experimental Results of Small- World Networks 
with Power-Law Degree Distribution 

Many real SW networks have the power-law degree dis- 
tribution p{k) ~ k~'^ , such as the Internet and WWW. 
They are called scale- free networks in which there are ver- 
tices with much larger degrees than randomly connected 
networks, such as ER (Erdos-Renyi) model. The largest 
degree of scale-free network is proportional to N'^/('^~^\ 
where N is the number of vertices in networks. The BA 
(Barabasi-Albert) model has been proposed to explain 
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FIG. 3: (Color online) Diameter and cluster coefficient as a function of exponent 7 (a); performance of greedy routing for 
different dimension of metric space (b) m = 5, (c) m = 10 and (d) m = 20. Networks are generated by the generalized BA 
model [H, 20]. Experimental results at each 7 are averaged over 20 realizations of the model. Scale- free networks with small 
exponent 7 show strong navigability represented by high successfully routing rate and low stretch for all dimensions. Like the 
results of SW models, both high cluster coefficient and low diameter are also necessary for navigability, and the large metric 
space dimension is also helpful to improve navigability. 



the emergence of power-law degree distributions based on 
the ideal of preferential attachment [l^ . We also investi- 
gate the navigability of scale-free networks generated by 
the generalized B A model jTol . [20j . In this model, a ver- 
tex is added in the network with m connections at each 
step. The probability of attaching to an existing vertex 
of degree k is proportional to k + ko, where the offset fco 
is a constant. Note that kg being larger than —m is to 
ensure positive probabilities. This model yields a power- 
law degree distribution with exponent 7 = 3 -|- fco/m. 
Negative values of fco lead to exponent less than 3, which 
has been observed in many real complex networks. 

The scale- free networks consist of 10'^ vertices together 
with m = 3 and the offset fco being turnable to get expo- 
nent 7 from 2 to 4. Results for different 7 are averaged 
over 20 networks. Small-world properties of scale-free 



networks are shown in Fig. 3(a) It is interesting that the 



cluster coefficients of scale- free networks quickly decrease 
with the growth of exponent, meanwhile the diameters 
only increase a little. Thus, the scale-free networks with 
small exponents exhibit strong SW properties. We con- 
struct a metric space and exactly execute greedy routing 
similar to SW models. Figure 3(b), (c) and (d) show the 
performance of navigation for different exponent 7. It 
demonstrates that when networks exhibit SW properties 
with small 7, strong navigability emerges. Stretches are 
also affected by cluster coefhcients because topology of 
highly clustered networks can be more properly mapped 
into a metric space. In addition, it can be seen that the 
high degree nodes act as hubs in navigation on scale-free 
networks Q- Therefore, as 7 increases, successfully rout- 
ing rates slightly drop because the highest degrees de- 
crease. However, when cluster coefficients continuously 
decrease, successfully routing rates start to increase be- 
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cause most messages are passed by random walks, which 
also lead to large stretches. 

IV. CONCLUSION 

In conclusion, we have well investigated the self- 
organized emergence of navigability on SW networks via 
mapping network topology into an Euclidean hidden met- 
ric spaces through a simple embedding algorithm inspired 
by information exchanging and accumulating and estab- 
lished in the absence of prior knowledge of underlying 
reference frames of networks. It has been demonstrated 
that high navigability emerges only if networks exhibit 
strong small-world properties. Because of the lacking of 
prior knowledge of underlying reference frames, the self- 
organized embedding algorithm can establish navigable 
scheme for different kinds of SW networks, which is sup- 
ported by the results of SW networks generated by WS 
model and BA model. 

Underlying reference frames, in which similar vertices 
are adjacent and connected at higher probability, explain 
how real complex networks are organized based on sim- 
ilarities between individuals. Since the clustering ten- 
dency of small-world networks satisfies the preferential 
attachment in underlying reference frames, the hidden 
metric space based on vertices similarities can be estab- 
lished by a universal algorithm, regardless of the explicit 
organizing pattern of networks. 

The self-organized navigation may be a possible ap- 



proach available for scalable routing on the Internet, 
which has gained lots interests recently. Many algorithms 
have been proposed to reduce the storage space of rout- 
ing table without remarkable increase of routing path 
lengths, e.g. the compact routing schemes [21-23]. The 
size of routing table could be reduced to polylogarith- 
mic of the network size in compact routing with stretch 
smaller than 3, yet global topology and central control 
required to build routing scheme in these algorithm have 
to demand large amount of communications on networks. 
In our work, since the constructing hidden metric space 
and greedy routing are distributed and localized in a self- 
organized way, communication are restricted between im- 
mediate connected vertices. Meanwhile, the sizes of rout- 
ing tables are the degrees of vertices, and stretches are 
quite small when the networks show small-world prop- 
erties. Comparing with previous work on navigation [5|- 
, our work may provide profound insights into scalable 
routing scheme through a self-organized way in the ab- 
sence of prior knowledge. 
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